Research on a Lip Reading Algorithm Based on Efficient-GhostNet
نویسندگان
چکیده
Lip reading technology refers to the analysis of visual information speaker’s mouth movements recognize content speech. As one important aspects human–computer interaction, lip has gradually become popular with development deep learning in recent years. At present, most networks are very complex, large numbers parameters and computation, model generated by training needs occupy memory, which brings difficulties for devices limited storage capacity computation power, such as mobile terminals. Based on above problems, this paper optimizes improves GhostNet, a lightweight network, it proposing more efficient Efficient-GhostNet, achieves performance improvement while reducing number through local cross-channel interaction strategy, without dimensionality reduction. The improved Efficient-GhostNet is used perform spatial feature extraction, then extracted features inputted GRU network obtain temporal sequences, finally prediction. We Asian volunteers recording dataset paper, also adopting data enhancement dataset, using angle transformation deflect process recorder 15 degrees each left right, order be able enhance robustness better reduce influence other factors, well improve generalization ability so that can consistent recognition scenarios real life. Experiments prove + achieve purpose comparable accuracy.
منابع مشابه
An Efficient Lip-reading Method Using K-nearest Neighbor Algorithm
Many studies have been carried out on lip reading, most of those works are based on color images, while some essential features might not be obtained, like inner lip information. In this paper, RGBD camera will be introduced for improving the recognition rate of lip reading. We try to complete lip reading through using only gray-scale images. Thirteen groups of words are given, and we present e...
متن کاملLip-reading based on a fully automatic statistical model
In this paper, we describe audiovisual automatic speech recognition experiments carried using visual parameters extracted from “natural” images. Unlike many other experiments in the AV ASR field, these visual parameters are obtained without any hand-labeling phase and are naturally noisy, due to the extraction process. We evaluate our models with different strategies among which : use of a shap...
متن کاملthe effect of genre-based teaching on reading comprehension of literary texts
تحقیق حاضر به بررسی کاربرد روش ژانر-محور را در محیط آموزش زبان عمومی می پردازد.روش ژانر-محور به زبان آموزان کمک میکند که در زمینه خوانش پیشرفت کنند. بعضی از محققین معتقد اند که روش تدریس ژانر-محور به تدریج به زبان آموزان کمک می کند تا در درک ژانر های مختلف مهارت یابند (هایلند 2004).همچنین امروزه توجه روز افزونی به اهمیت استفاده از ادبیات در برنامه آموزشی زبان انگلیسی (esl/efl ) شده است. زمانی ک...
15 صفحه اولResearch on Color Watermarking Algorithm Based on RDWT-SVD
In this paper, a color image watermarking algorithm based on Redundant Discrete Wavelet Transform (RDWT) and Singular Value Decomposition (SVD) is proposed. The new algorithm selects blue component of a color image to carry the watermark information since the Human Visual System (HVS) is least sensitive to it. To increase the robustness especially towards affine attacks, RDWT is adopted for its...
متن کاملa study on the effectiveness of textual modification on the improvement of iranian upper-intermediate efl learners’ reading comprehension
این پژوهش به منظور بررسی تأثیر اصلاح متنی بر بهبود توانایی درک مطلب زبان آموزان ایرانی بالاتر از سطح میانی انجام پذیرفت .بدین منظور 115 دانشجوی مرد و زن رشته مترجمی زبان انگلیسی در این پزوهش شرکت نمودند.
ذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Electronics
سال: 2023
ISSN: ['2079-9292']
DOI: https://doi.org/10.3390/electronics12051151